Approximating Semi-Matchings in Streaming and in Two-Party 

Communication 

Christian Konrad^ and Adi Rosen^ 

^ LIAFA, Universite Paris Diderot, France. 

konradSlri . f r 

^ CNRS and Univerite Paris Diderot - Paris 7, France. 

adiroOliaf a.univ-paris-diderot .fr 



Abstract. We study the communication complexity and streaming complexity of approximating 
unweighted semi-matchings. A semi-matching in a bipartite graph G = [A, B, E), with n = \A\, is 
a subset of edges S (^ E that matches all A vertices to B vertices with the goal usually being to 
do this as fairly as possible. While the term semi-matching was coined in 2003 by Harvey et al. 
fT^ [WADS 2003], the problem had already previously been studied in the scheduling literature under 

T-H different names. 

^^ We present a deterministic one-pass streaming algorithm that for any < e < 1 uses space 0(n^^'^) 

Cn and computes an 0(n''^~'^'")-approximation to the semi-matching problem. Furthermore, with 

^ O(logn) passes it is possible to compute an 0(logn)-approximation with space 0(n). 

Cah In the one-way two-party communication setting, we show that for every e > 0, deterministic 

■^U( communication protocols for computing an 0(n<i+'''^+i )-approximation require a message of size 

iy~, more than cn bits. We present two deterministic protocols communicating n and 2n edges that 

^vj compute an O(v'n) and an 0(n'^'^)-approximation respectively. 

Finally, we improve on results of Harvey et al. [Journal of Algorithms 2006] and prove new links be- 
I I tween semi-matchings and matchings. While it was known that an optimal semi-matching contains 

^^ a maximum matching, we show that there is a hierachical decomposition of an optimal semi- 

I I matching into maximum matchings. A similar result holds for semi-matchings that do not admit 
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length-two degree-minimizing paths. 



I 1 Introduction 

> 

\^ Semi-Matchings. A matching in an unweighted bipartite graph G = {A, B, E) can be seen as a one-to- 

^^ one assignment matching the A vertices to B vertices. The usual aim is to find a matching that leaves as 

few A vertices without associations as possible. A semi-matching is then an extension of a matching, in 
that it is required that all A vertices are matched to B vertices. This, however, is generally not possible 

'^ in an injective way, and therefore we now allow the matching of multiple A vertices to the same B vertex. 

Typical objectives here are to minimize the maximal number of A vertices that are matched to the same 
B vertex, or to optimize with respect to even stronger balancing constraints. The term 'semi-matching' 
l" was coined by |HLLT03| and also used in [FLN10| . however, the problem had already previously been 

. ^ intensely studied in the scheduling literature |ECS73IHor73IANR95IAbr03ILL04] . We stick to this term 

ji^ since it nicely reflects the structural property of entirely matching one bipartition of the graph. 

JH The most prominent application of the semi-matching problem is that of assigning a set of unit- 

length jobs to a set of identical machines with respect to assignment conditions expressed through edges 
between the two sets. The objective of minimizing the maximal number of jobs that a machine receives 
then corresponds to minimizing the makespan of the scheduling problem. Optimizing the cost function 
^^g^ deg5(6)(deg5(6) -|- l)/2, where degg{b) denotes the number of jobs that a machine b receives in 
the semi-matching S, corresponds to minimizing the total completion time of the jobs (optimizing with 
respect to this cost function automatically minimizes the maximal degree). 

It is well known that matchings are of maximal size if they do not admit augmenting paths |Ber57| . 
Augmenting paths for matchings correspond to degree-minim,izing paths for semi-matchings. They first 
appeared in |HLLT03] under the name of cost-reducing-paths, and they were used for the computation of 
a semi-matching that minimizes a certain cost function. We use the term 'degree-minimizing-path' since 
it is more appropriate in our setting. A degree-minimizing path starts at a B node of high degree, then 
alternates between edges of the semi-matching and edges outside the semi-matching, and ends at another 



B node of degree at least by two smaller than the degree of the starting point of the path. Flipping the 
semi-matching and non-semi-matching edges of the path then generates a new semi-matching such that 
the large degree of the start node of the path is decreased by 1, and the small degree of the end node 
of the path is increased by 1. An optimal semi-matching is defined in |HLLT03] to be one that does not 
admit any degree-minimizing paths. It was shown in [HLLT03] that such a semi-matching is also optimal 
with respect to a large set of cost functions, including the minimization of the maximal degree as well 
as the minimization of the total completion time. At present, the best existing algorithm for computing 
an optimal semi-matching [FLNlOj runs in time 0(-\/|y||-E| log \V\) where V = AU B. See |FLN10j 
for a broader overview about previous work on semi-matchings (including works from the scheduling 
literature). 

In this paper, we study approximation algorithm,s for the semi-matching problem in different compu- 
tational settings. The notion of approximation that we consider is with respect to the maximal degree: 
given a bipartite graph G = {A,B,E) with n = \A\, we are interested in computing a semi-matching S 
such that deg max S < c- deg max S* , where S* denotes an optimal semi-matching, deg max denotes the 
maximal degree of a vertex w.r.t. a set of edges, and c is the approximation factor. This notion of ap- 
proximation corresponds to approximating the makespan when the semi-matching is seen as a scheduling 
problem. This setting was already studied in e.g. |ANR95j . 

Streaming Algorithms and Communication Complexity. Streaming Algorithms fall into the 
category of massive data set algorithms. In many applications, the data that an algorithm is called upon 
to process is too large to fit into the computer's memory. In order to cope with this problem, a streaming 
algorithm sequentially scans the input while using a random access memory of size sublinear in the length 
of the input stream. Multiple passes often help to further decrease the size of the random access memory. 
Graph streams are widely studied in the streaming model, and in the last years matching problems have 
received particular attention |AGlllGKK12IKMM12|Kapl3| . A graph stream is a sequence of the edges 
of the input graph with a priori no assumption on the order of the edges. Particular arrival orders of the 
edges are studied in the literature and allow the design of algorithms that depend on that order. Besides 
uniform random order JKMM12] , the vertex arrival order |GKK12|Kapl3| of edges of a bipartite graph 
is studied where edges incident to the same A node arrive in blocks. Deciding basic graph properties 
such as connectivity already requires i7(|y|) space [FKM+OS] . where V denotes the vertex set of a graph. 
Many works considering graph streams allow an algorithm to use 0(|T^| polylog \V\) space. This setting 
is usually called the semi- streaming setting. 

Space lower bounds for streaming algorithms are often obtained via Gommunication Gomplexity. 
There is an inherent link between streaming algorithms and one-way fc-party communication protocols. 
A streaming algorithm for a problem P with space s also serves as a one-way fc-party communication 
protocol for P with communication cost O(sfc). Conversely, a lower bound on the size of any message of 
such a protocol is also a lower bound on the space requirements of a streaming algorithm. Determining the 
communication complexity of problems is in itself an important task, however, the previously discussed 
link to streaming algorithms provides an additional motivation. 

Our Contributions. We initiate the study of the semi-matching problem in the streaming and 
the communication settings. We present a deterministic one-pass streaming algorithm that for any 
< e < 1 uses space 0(ri^+'^) and computes an 0(n(^^'^/^) approximation to the semi-matching problem 
(Theorem lljp| Furthermore, we show that with O(logn) passes we can compute an O(logn) approxi- 
mation with space 0(n) (Theorem p|. 

In the two-party one-way communication setting, we show that for any e > 0, deterministic commu- 

1 
nication protocols that compute an 0(n(i+^)'=+i ) approximation to the semi-matching problem require a 

message of size at least en bits (Theorem Isl). We present two deterministic protocols communicating 

n and 2?! edges that compute an O(y^) approximation and an 0{n^'^) approximation, respectively 

(Theorem [3]). 

While it was known that an optimal semi-matching contains a maximum matching |HLLT03] . we 

show that there is a hierarchical decomposition of an optimal semi-matching into maximum matchings 



(Lemma 14 ) . Similarly, we show that semi-matchings that do not admit length-two degree-minimizing 



paths can be decomposed into maximal matchings (Lemma 13). The latter result allows us to prove 



We write 0(n) to denote 0(n polylog n) 



that the maximal degree of a sem.i-m.atching that does not admit a length-two degree-minimizing path 
is at most [log(n + 1)] times the maximal degree of an optimal semi-matching (Theorem [6]) . 

A semi-streaming algorithm for vertex arrival order. In |ANR95| , the semi-matching problem 
is studied in the online model (seen as a scheduling problem). In this model, the A vertices arrive online 
together with their incident edges, and it has to be irrevocably decided to which B node an A node is 
matched. It is shown that the greedy algorithm matching an A node to the B node that currently has 
the smallest degree is [log(n -I- 1)] competitive, and that this result is tight. This algorithm can also 
be seen as a one-pass [log(n -f 1)] approximation semi-streaming algorithm (meaning 0(n) space) for 
the semi-matching problem when the input stream is in vertex arrival order. Note that our one-pass 
algorithm does not assume any order on the input sequence, and when allowing 0(n) space it achieves 
an 0(v^)-approximation. 

Techniques. Our streaming algorithms are based on the following greedy algorithm. Fix a maximal 
degree dmax (for instance dmax = n^^^) and greedily add edges to a set S'l such that the maximal degree 
of a B node in S'l does not exceed dmax, and the degree of any A node in Si is at most 1. This algorithm 
leaves at most ©(n/dmax) A vertices unmatched in Si. To match the yet unmatched vertices, we use a 
second greedy algorithm that we run in parallel to the first one. We fix a parameter d' appropriately (if 
dmax = n^^'^ then we set d' — n^^^) and for all vertices a G ^4 we store arbitrary d' edges incident to a 
in a set E' . Then, we compute an optimal semi-matching S2 of the unmatched vertices in Si and the 
B nodes only considering the edges in E' . We prove that such a semi-matching has bounded maximal 
degree (if dmax — n^^'^ and d' — •n^l'^ then this degree is n}-^'^'). The set S'l U S'2 is hence a semi-matching 
of maximal degree dmax + deg max S2 and the space requirement of this algorithm is 0{nd'). In Section^ 
we generalize this idea for any < e < 1 to obtain one-pass algorithms with approximation factors 
0{n^^'^^^~'^^) using space 0(n^"'"'^), and a log(n)-pass algorithm with approximation factor O(logn) using 
space 0(n). 

In the two-party one-way communication setting, the edge set _E of a bipartite graph G = {A, B, E) is 
split among two players, Alice and Bob. Alice sends a message to Bob and Bob outputs a semi-matching 
of G. Our communication upper bounds make use of what we call a c- semi-matching skeleton (or simply 
c-skeleton). A c-skeleton of a bipartite graph G = {A, B, E) is a subset of edges S C E such that for any 
A' C A : deg max semi (^', _B, S) < c- deg max seTai{ A' ,B,E) where seYni{ A' ,B,E') denotes an optimal 
semi- matching between A' and B using edges in E' . We show that if Alice sends a c-skeleton S of 
her subgraph to Bob, and Bob computes an optimal semi-matching using his edges and the skeleton, 
then the resulting semi-matching is a c -|- 1 approximation. We show that there is an 0(-\/n)-skeleton 
consisting of n edges, and that there is an 0(n"'^'^)-skeleton consisting of 2n edges. It turns out that an 
optimal semi-matching is an 0(-v/n)-skeleton, and we show how an 0(n"'^/^)-skeleton can be improved to 
an 0(n^/'^)-skeleton by adding additional n edges. These skeletons are almost optimal: we show that for 
any e > 0, an 0(n(i+^)^+T)-skeleton has at least en edges. Inspired by the prior lower bound, we prove 
that for any e > 0, the deterministic one-way two-party communication complexity of approximating 
semi-matchings within a factor 0{n'-'-+''>''+^ ) is at least en bits. 

In order to prove our structure lemmas on semi-matchings, we make use of degree-minimizing paths. 
Our results on the decomposition of semi-matchings into maximum and maximal matchings directly 
relate the absence of degree-minimizing paths to the absence of augmenting paths in matchings. See 
Section [5] for details. 

Organization. After presenting notations and definitions in Section [2] we present our streaming 
algorithms in Section [3J We then discuss the one-way two-party communication setting in Section |4J We 
conclude with Section [51 where we present our results on the structure of semi-matchings. 

2 Notations and Definitions 

Let G ~ {A, B, E) be a bipartite graph and let n = \A\. For ease of presentation, we assume that \B\ is 
upper-bounded by a polynomial in n. Let e G £■ be an edge connecting nodes a £ A and b G B. Then, 
we write A{e) to denote the vertex a, B{e) to denote the vertex b, and ab to denote e. Furthermore, for 
a subset E' C E, we define A{E') = {J^^^, A{e) (respectively B{E')). For subsets A' C A and B' C B 
we write E'\a'xB' to denote the subset of edges of E' whose endpoints are all in A' U B' . We denote by 



E'{a) the set of edges oi E' C E that have an endpoint in vertex a, and E'{A') the set of edges that 
have endpoints in vertices of A' , where A' C A (similarly we define E'{B') for B' C B). 

For a node v € AVJ B^ the neighborhood of v is the set of nodes that are connected to v and we 
denote it by r{v). For a subset E' C E, we write E^'iv) to denote the neighborhood of v in the graph 
induced by E' . Note that by this definition r{v) = /^(w). For a subset E' C £■, we denote by deg^t{v) 
the degree in ii^' of a node v € V, which is the number of edges of E' with an endpoint in v. We define 
degmaxi?' := max^jg^us deg£,(w). 

Matchings. A matching is a subset MCE such that \/v G AU B : degJ^J{v) < 1. A maximal 
matching is a matching that is inclusion-wise maximal, i.e. it can not be enlarged by adding another 
edge of E to it. A maximum matching is a matching of maximal size. A length p augmenting path [p > 3, 
p odd) wrt. a matching M is a path P — {vi,.. . ,Wp+i) such that vi,Vp+i ^ A{AI) U B{M) and for 
i < l/2{p - 1) : V2iV2i+i e M, and V2i-iV2i ^ M. 

Semi-Matchings. A semi-matching of G is a subset S C E such that Va E A : deg5(a) = 1. 
A degree- minimizing path P = (6i,ai, . . . , 6fc_i, a/j-i, 6^) with respect to a semi-matching S" is a path 
of length 2k {k > 1) such that for a\\ i < k : {ai,bi) € S, for all i < k — 1 : (ai,6i+i) ^ S, and 
deg5(6i) > deg5(62) > deg5'(63) > • • • > deg(fefc_i) > deg(5fc). An optimal semi-matching S* C E is a 
semi-matching that does not admit any degree-minimizing-paths. For subsets A' C A,B' C B,E' C E, 
we denote by sem.i{A' , B' , E') an optimal semi-matching in the graph G' = {A',B' ,E'), and we denote 
by semi2(A', B', E') a semi-matching that does not admit degree-minimizing paths of length 2 in G' . 

Incomplete d-bounded Semi-Matchings. Let d be an integer. Then an incomplete d-bounded 
semi-matching of G is a subset S C E such that Va G A : degg(a) < 1 and ^b € B : degg{b) < d. 
For subsets A' C A,B' C B,E' C E, we write iseinid{A' , B' , E') to denote an incomplete d-bounded 
semi- matching of maximal size in the graph G' — {A' , B' ,E'). 

Approximation. We say that an algorithm (or communication protocol) is a c-approximation al- 
gorithm (resp. communication protocol) to the semi-matching problem if it outputs a semi-matching 5* 
such that deg max S < c- deg max S* , where S* denotes an optimal semi-matching. We note that this 
measure was previously used for approximating semi-matching, e.g, in |ANR95J . 

3 Streaming Algorithms 

To present our streaming algorithms, we describe an algorithm, ASEMi{G, s,d,p) (Algorithm [ij , that 
computes an incomplete 2cij)-bounded semi-matching in the graph G using space 0(s), and makes at 
most p > 1 passes over the input stream. If appropriate parameters are chosen, then the output is not 
only an incomplete semi-matching, but also a semi-matching. We run multiple copies of this algorithm 
with different parameters in parallel in order to obtain a one-pass algorithm for the semi-matching 
problem (Theorem [1]) . Using other parameters, we also obtain a log(n)-pass algorithm, as stated in 
Theorem m 

Algorithm 1 Skeleton for approximating semi-matchings: ASEMi{G, s,d,p) 
Require: G = (A, B, E) is a bipartite grapli 

repeat at most p times or until |^(5)| = \A\ 

5 <- 5 U incomplete(G|(a\a(S))xs, «, d) 
end repeat 
return S 



ASEMi(G, s, d,p) starts with an empty incomplete semi-matching S and adds edges to S by invoking 
incomplete(G, s, d) (Algorithm [2]) on the subgraph of the as yet unmatched A vertices in S and all 
B vertices. Each invocation of incomplete(G, s, d) makes one pass over the input stream and returns 
a 2(i-bounded incomplete semi-matching while using space 0(s). Since we make at most p passes, the 
resulting incomplete semi-matching has a maximal degree of at most 2dp. 

incomplete(G, s, d) collects edges greedily from graph G and puts them into an incomplete d- 
bounded semi-matching 5*1 and a set E' . An edge e from the input stream is put into Si if Si U {e} is 



Algorithm 2 Computing incomplete semi-matchings: incomplete(G, s, d) 
Require: G = {A, B, E) is a bipartite graph 
k -ir- s/\A\, Si ^ 0, £;' <- 
while 3 an edge ab in stream do 

if ab ^ A X B then continue 

if degg^ (a) = and degg^ (fo) < d then Si <— Si U {ab} 

if degg, (a) < k then E' ^ E' U {ab} 
end while 

S2 <- isemid(-E'|(A\A(Si))xs) 
S ^ Si U S2 
return S 



still an incomplete d-bounded semi-matching. An edge e = a6 is added to E' if the degree of a in E'U {e} 
is less or equal to a parameter k which is chosen to be s/\A\ in order to ensure that the algorithm does 
not exceed space 0(s). The algorithm returns an incomplete 2(i-bounded semi-matching that consists of 
Si and S2 , where S2 is an optimal incomplete d-bounded semi-matching between the A vertices that are 
not matched in 6*1 and all B vertices, using only edges in E' . 

We lower-bound the size of S2 in Lemmafll We prove that for any bipartite graph G — {A, B, E) and 
any A: > 0, if we store for each a e A any max{fc, degQ{a)} incident edges to a, then we can compute an 
incomplete d-bounded semi-matching of size at least min{fc(i, |A|} using only those edges, where d is an 
upper-bound on the maximal degree of an optimal semi-matching between A and B in G. 

Lemma IT] is then used in the proof of Lemma |2l where we show a lower bound on the size of the 
output Si U 5*2 of incomplete(G, s, d). 

Lemma 1. Let G = {A,B,E) be a bipartite graph, let k > and let d > degmaxsemi(A, i?, £'). 
Furthermore, let E' Q E be a subset 0/ edges such that for all a ^ A : deg^, (a) — min{fc, deg^(a)}. Then 
there is an incomplete d-bounded semi- matching S ^ E' such that \S\ > minjfcd, \A\}. 

Proof. Let d* — degmaxsemi(yl, _B,_E). We explicitly construct an incomplete semi-matching S. Let 
Aq C A such that for all a ^ Aq : deg^,{a) = deg^(a), and let Ai ~ A\ Aq. Let 5*0 = semi(ylo, B, E). 
Clearly, degmaxS'o < d* . We construct now S as follows. 

Start with S = Sq, and then add greedily edges in any order from -B'Iaixb to S such that S remains 
an incomplete semi-matching with maximal degree d. Stop as soon as there is no further edge that can 
be added to S. 

We prove that S contains at least ram{kd, \A\} edges. To see this, either all nodes of A are matched 
in iS", or there is at least one node a G Ai that is not matched in S (note that all nodes in Aq are matched 
in S). Since deg^, (a) = k, all nodes b e /£■/(«) have degree d since otherwise a would have been added 
to S. This implies that there are at least k ■ d nodes matched in S which proves the lemma. D 

Lemma 2. Let G ~ {A,B,E) be a bipartite graph, let s > \A\ and let d > degmaxsemi(A, i?, £'). 
Then INCOMPLETe(G, s, d) (see Algorithm L2[) uses 0(s) space and outputs an incomplete 2d-bounded 
semi-matching S such that \S\ > min{|^| ^ , ^. -|- -[-jr, \A\}. 

Proof. The proof refers to the variables of Algorithm [2] and the values they take at the end of the 
algorithm. Furthermore, let S* — semi{A, B , E) , d* = degmaxS**, and let A' — A\ A{Si). 

Firstly, we lower-bound IS*!]. Let a G A' and b = S*{a). Then deg5.^(6) = d since otherwise a 
would have been matched in Si. Hence, we obtain |v4(S'i)| > d\B{S*{A'))\ > d\A'\/d* , where the second 
inequality holds since the maximal degree in S* is d* . Furthermore, since A' = A\ A{Si) and \Si\ = 
|^(5'i)|, we obtain \Si\ > \A\-^j^^. We apply Lemmalllon the graph induced by the edge set E'\a'xb- 
We obtain that |S'2| > min{ds/\A\, \A'\} and consequently \S\ = \Si\-\- \S2\ > min{|^|^q^ -\- |f|, \A\}. 

Concerning space, the dominating factor is the storage space for the at most A; -1-1 edges per A vertex, 
and hence space is bounded by 0(fc|^|) = 0(s). D 

In the proof of Theoremlll for < e < 1 we show that ASEMi(G, ri^+'^, n^^'^^^~'^^d' , 1) returns a semi- 
matching if d' is at least the maximal degree of an optimal semi-matching. Using a standard technique, 
we run log(n) -I- 1 copies of ASEMI for all d' — 2* with < i < log(n) and we return the best semi- 
matching, obtaining a 1-pass algorithm. We use the same idea in Theorem [2] where we obtain a 41og(n) 
approximation algorithm that makes log(n) passes and uses space 0{n). 



Theorem 1. Let G = {A,B,E) be a bipartite graph with n — \A\. For any < e < 1 there is a one-pass 
streaming algorithm using 0{n^'^'^) space that computes a An^''^^^^^ approximation to the semi-matching 
problem. 

Proof. We run log(n) + 1 copies of Algorithm IT] in paraUel as fohows. For < i < [log(n)] let Si — 
ASEMi(G', n^'*''^, n^/^'^^~'^)2*, 1) and choose among the Si a semi-matching Sk such that |5fc| = n and for 
any other Si with \Si\ = n : degmax^fe < degmaxS*;. 

We show now that there is a Sj which is a semi-matching that fulfills the desired approximation 
guarantee. Let 5* = semi(A, _B,_E) and d* = degmax(S*). Then define j to be such that d* < 2^ < 2d* 
and let d = v}l'^'^^~^^2K Sj is the output of a caU to iNCOMPLETE(G,r7,^+% d). By Lemma pj Sj is of 
size at least minjn ^_^^, -I- dn'^, \A\} which equals \A\ for our choice of d. This proves that all a S A are 

matched in Sj. By Lemma ^ degmaxSj < 2d which is less or equal to 4n^/^(^~')(i*. Hence, Sj is a 
4^i/2(i-e) approximation. 

The space requirement is log n times the space requirement for the computation of a single Si which 
is dominated by the space requirements of Algorithm bl By Lemma bl this is 0(n^+^), and hence the 
algorithm requires O(n^+^logri) — 0(ri^+'^) space. D 

Theorem 2. Let G — {A,B,E) be a bipartite graph with n ^ \A\. There is a log{n)-pass streaming 
algorithm using space 0{n) that computes a 41og(n) approximation to the semi-matching problem. 

Proof. As in the proof of Theorem [11 we run log(n) -I- 1 copies of Algorithm IT] in parallel. For < i < 
[log(n)] let Si = ASEMi(G', n, 2% log(ri)) and choose among the 5^ a semi-matching Sk such that \Sk\ = "■ 
and for any other Si with \Si\ — n : deg max Sk < deg max 5; . 

We show now that there is a 5*^ which is a semi-matching that fulfills the desired approximation 
guarantee. Let S* = sem.[{A,B,E) and d* = degmax(S'*). Then define j to be such that d* < 2^ < 2d* 
and let d — 2^. Sj is the output of a call to ASEMi(G,n, (i,log(n)). In each iteration, the algorithm 
calls iNCOMPLETE(G",rT;, d), where G" is the subgraph of G of the not yet matched A vertices and the 
B vertices. By Lemma 2 at least a ji^ > 1/2 fraction of the unmatched A vertices is matched since 
d > d* , and the maximal degree of the incomplete semi-matching returned by INCOMPLETe(G', n, d) is 
at most 2d. Hence, after log(n) iterations, all A vertices are matched. Since d < 2d* and the algorithm 
performs at most log(n) iterations, the algorithm returns a 41og(n) approximation. 

Each copy of Algorithm [ij uses space 0(n) and since we run O(logn) the required space is 0(n). D 

4 Two-party Communication Complexity 

We now consider one-way two-party protocols which are given a bipartite graph G = {A, B, E) as input, 
such that El Q E \s given to Alice and E2 Q E \s given to Bob. Alice sends a single message to Bob, 
and Bob outputs a valid semi-matching S for G. A central idea for our upper and lower bounds is what 
we call a c- semi-matching skeleton (or c-skeleton). Given a bipartite graph G = {A,B,E), we define a 
c-semi-matching skeleton to be a subset of edges S C E such that \/A' C A : deg max semi ( A', B, S") < 
c • degmaxsemi(A', J3, i;^). We show how to construct an 0(-yn)-skeleton of size n, and an 0(n^/'^)- 
skeleton of size 2n. We show that if Alice sends a c-skeleton of her subgraph G = {A,B,Ei) to Bob, 
then Bob can output a c+ 1-approximation to the semi-matching problem. Using our skeletons, we thus 
obtain one-way two party communication protocols for the semi-matching problem with approximation 
factors 0(-y/n) and 0(n^/'^), respectively (Theoremp|. Then we show that for any e > 0, an 0(ri<i+"''^+i )- 
skeleton requires at least en edges. This renders our 0(-v/n)-skeleton and our 0(ri^/^)-skeleton tight up 
to a constant. 

4.1 Upper Bound 

Firstly, we discuss the construction of two skeletons. In LemmalSl we show that an optimal semi-matching 
is an 0(-v/n)-skeleton, and in Lemma^ we show how to obtain a 0(n^/'^)-skeleton. In these constructions, 
we use the following key observation: Given a bipartite graph G — {A, B, E), let A' C Ahe such that A' 
has minimal expansion, meaning that A' = argmin^//^^ ' \,n . The maximal degree in a semi-matching 

is then clearly at least [ irMMl l since all vertices of A' have to be matched to its neighborhood. However, 



I 4 ' I 

it is also true that the maximal degree of a semi-matching equals [ irMMl l • ^ similar fact was used in 
[GKK12J for fractional matchings, and also in |KRT01j . For completeness, we are going to prove this fact 
in Lemma |4] This proof requires the following technical lemma, Lemma [3J 

Lemma 3. Let G == (A.B.E) be a bipartite graph and let A' C A such that \r{A')\ < \A'\. Then: 



VA"CA':^H^>I^(^')I 



degmaxsemi(A', B, i?) < [- 



l^'l 



rl- 



Proof. The proof is by contradiction. Let d = [ iX .L ] , S = semi(j4', B, E) and suppose that deg max S > 



\r(A')[ 
d+1. We construct now a set ^ C A' such that 



< ,\,, contradicting the premise of the lemma. 



\A\ - \A'\ 

To this end, we define two sequences {Ai)i with Ai C A' and (i?i)i with Bi C r{A'). Let b £ r{A') 
be a node with deg5(6) > d+1 and let Bi — {b}. We define 



A, = FsiB,), 



(1) 



This setting is illustrated in Figure[T| Note that all Ai and all Bi are disjoint. Let k be such that \Ak\ > 
and \Ak+i\ = 0. Then we set A = [j^^^ A,. 

By construction of the sequence {Bi)i, it is clear that for any b' e (JBi : degg{b') > deg5(6) — 1, 
since otherwise there is a degree-minimizing path from b to b' contradicting the definition of S. Then, 
by Equation fl] we obtain for all i that 1^4,1 > \Bi\{degg{b) — 1) which implies that \Ai\ > d\Bi\ since 
deg5(6) > d -I- 1. Remind that |Ai| > d -I- 1. We compute 



^2<i<k 



\B,, 



< 



1 + E 



2<i<k 



B 



\r{A)\ ^ |i?il + E 

|i| l^ll + E2<.<fe \M " (rf + 1) + E2<^<k 

and we obtain a contradiction to the premise of the lemma. 



^ 1 ^ \riA')\ 

^\B,\d d- \A'\ ■ 



D 



AI 



A2 




Bl = {b} 



-® edge in S 
o edge in E \ S 



Fig. 1. Illustration of the proof of Lemmalsl AH nodes b' € Ui>2 ^i have degg(6') > degg(6) — 1 since otherwise 
there is a degree-minimizing path. To keep the figure simple, only those edges oi E\ S are drawn that connect 
the Ai to Bi+i. Note that in general there are also edges outside S from Ai to IJ7<i ^j- However, there are no 
edges in the graph from Ai to IJ >j , j Bj. 



Lemma 4. Let G = {A, B, E) with \A\ = n, and let d = degmaxscmi(y4, B, E). Let A' be a subset of A 
with minimal expansion a, that is 

\r{A")\ 



A' = ar, 



and let a — ' i^/i • Then: 



■gmm- 

A"<ZA \A 

d= \a-^]. 



Proof. Wc show that d> \a ^~\ and d < \a ^~\ separately. 

1. d> \a^^~\: The set A' has to be matched entirely to vertices in its neighborhood. Therefore, there 
is a node b e r{A') with degree at least [ i^.^L ] = ["^"^1- 

2. d < \a~^~\: We construct a semi- matching explicitly with maximal degree d. Since an optimal 
semi-matching has at most this degree, the claim follows. 

Consider a decomposition of A into sets Ai,A2,... as follows. Ai C A is a set with minimal expansion, 
and for i > 1, At C A\ (Uj<i ^j) is the set with minimal expansion in G|(^\y . . Aj)x{B\r{\j . . Aj))- 
We construct a semi-matching S — Si U S2 ■ ■ ■ as follows. Firstly, match Ai to r{Ai) in 6*1. By 
Lemma 3 the maximal degree in Si is at most rT/TATrl — \'^^^^■ 
For a general Si, we match Ai to vertices in r{Ai) \ r{[J-^^ Aj). By Lcmmalsl the maximal degree 

in S, is at most r |r(A.)\rtu,<, A,)| 1 • 

This decomposition is illustrated in Figure [2] 

Furthermore, it holds 



\A, 



< 



\A 



Jj<i+i "^i) 



since if this was not true, then the set Ai U ^i+i would have smaller expansion in the graph 
G|(A\U A )x(B\r(U A )) than Ai. This implies that degmaxS* = degmaxS*! which in turn is 

D 



AI 



Bl 




-o edge in S 
o edge in E \ S 



Fig. 2. Illustration of the graph decomposition used in the proof of Lemma 14] Here, Bi is the set r{Ai) \ 
r{[J,^^Aj). The neighborhood of Ai in G is a subset of U7<i^i- f^i 'S'l however, Ai is matched entirely to 
vertices in Bi. 



We prove now that an optimal semi-matching is a 0(-y/n)-skeleton. 
Lemma 5. Let G == (^4, B, E) with n = \A\, and let S = semi(yl, B, E). Then: 

VA' C A : degmaxsemi(A',B, S") < ^n (degmaxsemi(A', _B, £;))^/^ + 1. 



Proof. Let A' <Z Ahe an arbitrary subset. Let A" 



\rs{A"')\ 
argmm^,„cA' \a'"\ ■ 
-\A"\- 



and let k = \rs{A")\. Let 



d = degmaxsemi(A', B, S). Then by Lemma 4l d = \—i^'\ . Furthermore, since A" is the set of minimal 
expansion in S, for all b G FsiA") : degg{b) — d, and hence \A"\ = kd. 

Let d* = degmaxsemi(A",B,£:). Then d* < degmaxsemi(yl', B,^), since A" C A'. It holds that 
Vx G rE{A") \ FsiA") : degs(a;) > d— I since otherwise there was a degree-minimizing path of length 2 
in S. Figure [3] illustrates this setting. The sum of the degrees of the vertices in rE{A") is upper-bounded 
by the number of A nodes. We obtain hence (|/£;(^")| — k){d — 1) + kd < n, and this implies that 
\rEiA")\ < ^. Clearly, d* > \A"\/\rEiA")l and using the prior upper bound on |/£;(^")| and the 



equality \A"\ = kd, we obtain d* > 



fcrf(d-l) 



which implies that d < 



1 for any fc > 1. 



D 



A" 



A\A" 




rs{A") 



-o edge in S 
■o edge in E \ S 



rE{A")\rs{A") 



Fig. 3. Illustration of the proof of Lemma [sl All nodes b £ Fsi^A") have degg(6) 
rE{A") \ rs{A") have degs(b) > d - 1. 



d, and all nodes b' £ 



In order to obtain an 0(n-'^/'^)-skeleton, for each a & A we add one edge to the 0(-yAi)-skeleton. 
Let S = se\m{A,B,E) be the 0(-y/ri)-skeleton, let B' — B{S) be the B nodes that are matched in the 
skeleton, and for all b ^ B' let Ai, = Fsib) be the set of A nodes that are matched to b in S. Intuitively, 
in order to obtain a better skeleton, we have to increase the size of the neighborhood in the skeleton of 
all subsets of A, and in particular of the subsets Af, for b e B' . We achieve this by adding additional 
optimal semi-matchings Sb — semi(A(,, B, E) for all subsets A^ with b <E B' to S, see Lemmals] We firstly 
prove a technical lemma, Lemma [6J that points out an important property of the interplay between the 
matchings S and the niatchings St for b G B' . Then, we state in Lemma ^ an inequality that is an 
immediate consequence of Holder's inequality. Lemma [7] is then used in the proof of Lemma [8j which 
proves that our construction is an 0(n^/^)-skeleton. 

Lemma 6. Let G = {A, B, E), A' C A, A" C A' , and let S = semi(A', B, E). Furthermore, let Fs{A') = 
{&!,..., bk}, and \/bi G Fs{A') : let A[ = Fs{b,) n A', and A'l = Fsih) (1 A". Then: 

degmaxsemi(A",B,^)-i ^ l^^'KI^I - 1) < l^'l- 
Proof. Let S" — seini{A",B,E), and denote d — degmaxS"'. Clearly, 

J2 degsib")<\A'\. (2) 

b"eB(S") 

Consider any b" E B{S"). We bound degg{b") from above as follows 

degsib") > maxJIA^I - 1 : 3a G A^' with 6" e FE{a)}. (3) 

Let j be such that \A'A — 1 poses the maximum of the set in the right hand side of Inequality [sl 
Note that if Inequality [3] was not true, then there would be a length two degree minimizing path m 
S connecting 6" and bj . The setup up visualized in Figure li] We bound now the right hand side of 
Inequality [3] as follows 

(I4|-l) = max{|yl'j-l : 3a e A'^ with b" e Fe (a)} 

^ E d^^i^-(-H)i-^)- (^) 

aers„ib") ^^ ^ ^ 

We used here that \A'g(^s(a))\ ^ l^jl ^°^ ^.ny a e rs"ib"), and \a e Fs"{b")\ = deggnib"). Since 
d = degmaxS"', and using Inequalities ^ and HI we obtain 



degsib")> ^ l(|^'^^^^^^^|_i) 



(5) 



aers„{b") 



We combine Inequalities [2] and [5] and the result follows 

|A'|> Yl deg5(n> E E i(I^B(5(a))|-l) 



6"eB(S") 



b"eS(S")aGrs„(fc") 

^EKii^:-ii- 



n 



A'j 




o o edge in S 

o- — -o edge in S" 

o node in A" 

• node in A' \ A" 



Fig. 4. Illustration of the proof of Lemma [m The degree of b" in S is at least \A'j\ — 1. Otherwise there would be 
a length two degree-minimizing path between b" and bj. 



In the proof of Lemma |8J we also need the following inequality. 
Lemma 7. Let xi, . . . ,Xk > 0, and let p > be an integer. Then: 

Proof. This is an immediate consequence of Holder's inequality: 



E-»<(E-n^/^fc' 



i=l 



i=l 



D 



Lemma 8. Let G = {A,B,E) be a bipartite graph with n — \A\. Let S = seim{A,B,E), and for all 
b e B{S) : Sb = semi{rs{b),B, E). Then: 

VA' C A :degmaxsemi(yl',B,S'U |J Sb) < [271^^^ degmaxsemi(A', B,^)]. 

beB(S) 

Proof Let A' C A. Let S = SuUfegsis) ^b- Let A" ^ argmin^,„cA. '^fjt'J''" and let k = |r^(A")|- From 
Lemma 4 it follows that degmaxsemi(A', B, S") = [■'-j-'-]- Furthermore, let d — degmaxsemi(yl", B, i?). 
For a node b e r§{A"), let A'^ = {a e A : S{a) = b}. For two nodes bi,bj e r§{A"), let A'^^,^, = {a G 
A" : S{a) = b,,Sb,ia) = bj}. 

We consider the cases k > n^l'^ and k < n^l'^ separately. 
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1. k > n^'^. Consider the semi-niatching S. From Lemma ^ we obtain the condition 

k 

l/dJ2\A':\{A,-l)<n, 



and since A" < Ai we obtain from the prior Inequahty that 

fc 



\ldY^i\P^l\-\f <n. 

Using Xli=i l^i'l — l^"l ^ii'i Lemma [TJ we obtain 

ll(|A"|-fcf<n, ^ 

\A!'\<\fndk^k. (6) 

Then, since dcgmaxsemi(A", i?, S*) = [ ^r-^l, we obtain from Inequahty 6 deg max semi (A", B, S") < 
\/t\ + 1- Since k > n^/^, we conclude that 

degmaxsemi(^",B,S') <n^/^Vd + 2. 

2. k < n^/^. We consider here the two subcases \A"\ < 2dk'^ and \A"\ > 2dk'^. 

(a) \A"\ < 2dP. Then since degmaxsemi(A", S, 5) = r^4rlj we conclude that 

degmaxsemi(A",B,S') < [2^^] < \2dn^'^}. 

(b) \A"\ > 2dk^. Let b e B{S) and consider the semi-matching Sh matching A'^ to B. From Lemmara 
and the fact that A'^ i,. ^ ^f, ft. we obtain 



k 

-Y,\A'LM\Al,J-l)<\A,\, 



1=1 

k 



By Lemma [71 we obtain 



k 
d 



j^Kf-jKI<l^.|. (7) 



Consider now the semi-matching S. From Lemma |6] we obtain the condition 

k 
d 



i^|A'/|(|A,|-l)<n. (8) 

j=i 

Using Inequality [7] in Inequality [8] and simplifying, we obtain 



d-^' " \'dk' " d' 



i— 1 i— 1 i — 1 

^ •Ar-:^\Ar-]\A"\<n. (9) 



d2fc3l I d2fcl I ^ 

■> V ' ^-v- 

11 



Since \A"\ > 2dk^, we can upper bound the terms / and // from Inequality ^ as follows 

1 



2d3fc3 



4d3fc4 

Using bounds 10 and 11_ in Inequality [9] and simplifying, we obtain 

1 



|A"|3 > /, and (10) 

^ \A"\^ > II. (11) 
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we conclude that 



Since degmaxsemi(^", i?, 5) — \—^A^, and using Inequality 

degmaxsemi(A",B,5) < \2^/^n^/^d'^/^]. 

Combining the bounds from cases 1, 2a and 2b, the result follows. D 

We mention that there are graphs for which adding further semi-matchings Sbib2 = semi(Abj;,2j B, E) 
to our 0(ni/3)-skeleton, where Ai,-^b2 is the set oi A vertices whose neighborhood in our 0(71^/3 )-skeleton 
is the set {6i, 62}, does not help to improve the quality of the skeleton. Before stating our main theorem. 
Theorem [3] we show in Lemma [9] that if Alice sends a c-matching skeleton, then Bob can compute a 
c + 1 approximation. Then, we state our main theorem. 

Lemma 9. Let G = {A,B,E) be a bipartite graph and let Ei,E2 be a partition of the edge set E. 
Furthermore, let E[ C Ei such that for any A' C A(Ei): 

degmaxsemi{A{Ei), B,E[) < cdegmaxsemi(A(iJi), i3,iJj). 

Then: 

degmaxsemi(A, i?, £'J U -E2) < (c + 1) degmaxsemi(yl, _B,_E). 

Proof. We construct a semi-matching S between A and B with edges from E[ U E2 explicitly and we 
show that degmaxS* < (c + 1) dcgmaxsemi(A, i?, i?). Since degmaxsemi(yl, i?, £'J U E2) < degmaxS, 
the result then follows. 

Let S2 ^ scmi{A, B,E) n E2, and let Si = semi(A \ A{S2),B,Ei). Then S = Si U S2. Clearly, 
degmax5'2 < deg max semi ( A, i3,i?). Furthermore, by the premise of the lemma we obtain deg max 6*1 < 
cdegmaxsemi(yl,i3,£^). Since degmaxS' < degmax5'i -l-degmaxS'2 and degmaxiSi -l-degmaxS'2 < 
(c + 1) degmax(yl, B, E) the result follows. D 

Theorem 3. Let G — {A,B,E) with n — \A\ and m — \B\. Then there are one-way two party deter- 
ministic communication protocols for the semi-matching problem, one with 

1. message size en log to and approximation factor n^''^ + 2, and another one with 

2. message size 2cn\ogm and approximation factor 2n^'^ + 2. 

Proof. Alice computes the skeletons as in Lemma[5]or in Lemma[8]and sends them to Bob. Bob computes 
an optimal semi-matching considering his edges and the edges received from Alice. By Lemma [9] the 
results follow. D 

4.2 Lovifer Bounds for Semi-matching-skeletons 

We present now a lower bound that shows that the skeletons of the previous subsection are essentially 
optimal. For an integer c, we consider the complete bipartite graph Kn^m where ttt. is a carefully chosen 



value depending on c and n. We show in Lemma 10 that for any subset of edges E' of Kn,,n such that 



for all a ^ A : deg^,{a) < c, there is a subset A' C A with 1^4' | < m such that an optimal semi-matching 
that matches A' using edges in E' has a maximal degree of f2{n^+^). Note that since \A'\ < m, there is a 
matching in Kn^m that matches all A' vertices. This implies that such an E' is only an J7(n^+T)-skeleton. 

12 



Lemma 10. Let G = {A,B,E) be the complete bipartite graph with \A\ — n and \B\ = (c!) <:+in<^+i for 
an integer c. Let E' (- E be an arbitrary subset such that Va € A : deg^, (a) < c. Then there exists an 
A' C A with \A'\ < \B\ and 



(c!) 



e + l 



degmaxsemi(A',B,£;') > -^^^^^^^^ n^^+i > e ^■^n<^+i. (13) 

c 

Proof. Let E' C E he as in the statement of the lemma. Let E" be an arbitrary superset of E' such that 
Va G A : deg^„{a) — c. Since degmaxsemi(74', B, E") < degmaxsemi(A', B, E') it is enough to show the 
lemma for E" . Denote by ^{ii,...,i^} the subset of A such that Va £ ^{ii,...,i^} : rE"{a) — {bi^, . . . ,6^^}. 
Then 

m= E l^'l' (14) 



Ai:z— {ii ,...,ic} and 
{6i, ,...,fci^ } is a c-subsct of B 



since Va G A : deg^/, (a) = c. Suppose for t he s ake of a contradiction that Inequality 13 



is not true. 



Then for all At on the right side of Inequality 14 we have \Ai\ < (c!) '=+in<^+i . There are at most (' ') 
such sets. This implies that: 

,., f\B\\ ,„i 1 \BW „ 1 1 (c!)^n*,„i 

L4 < ' ' • (cl)^^n^^ < ■L-L(c!)s+T„s+T < ^^ — . (c!)^^n^^ = n. 

\ c I c! c! 



This is a contradiction to the fact that \A\ > n and proves the first inequality in Inequality 13 To proof 
the second, we apply Stirling's formula, and we obtain 

(C!)^+T (v^C''+-^/^e~'')^'+T l/21n(2^)-l/21n(c)-c 

^^^ > ^ = e =+1 

c c 

It can be shown that for any c > 0, — — '^ c+i — ' > —1-3 which proves the result. D 

We extend Lemma [lO] now to edge sets of bounded cardinality without restriction on the maximal 
degree of an A node, and we state then our lower-bound result in Theorem |4] 

Lemma 11. Let c > be an integer, let e > be a constant, and let c' = (1 + e)c. Let G — {A,B,E) 

1 1 

be the complete bipartite graph with \A\ — n and \B\ — (c'!)<:'+i (yx^ ■ n)''+'^ . Let E' C- E be an arbitrary 

subset of size at most c ■ n. Then there exists an A' Q A with \A'\ < \B\ and 

degmaxsemi(A',i?, £;') > e"^-^(- n)s^+i. (15) 

Proof. Split A into A> and v4< such that for all a G Ay : deg^/ (a) > c', and for all a E A< : dcgg/ (a) < c'. 



Then |A>|c' + |A<| < en which implies that |^<| > Yrr^n. Let G" = G\a^xb- Then by Lemma 10 applied 
on G" there is a subset A' C A< with \A'\ < \B\ such that 

degmaxsemi(A',B, £''U^xb) > e"^-^|A<|^'+T, 
and since degmaxsemi(A', i?,i?'|^^xB) — degmaxsemi(yl'. i?,_E'), the result follows. D 

Theorem 4. Let c > be an integer. Then for all e > 0, an 0(n(i+^)<=+i ) -semi-matching skeleton requires 
at least en edges. 

4.3 One-v^ray, two party communication lower bound 

To prove a lower bound on the deterministic communication complexity we define a family of bipartite 
graphs. For given integers n and m, let Gi = {Gi{x)\x G {0,1}"^'"} be defined as follows. Let Bq = 
{&?,..., 6^}, Bi = {6},...,6^}andA = {ai,...,a„}. Given x G {0, l}"x™, let S, = {(a„&;-0|l < 
i < n,l < j < m} (i.e, the entries of the matrix x determine if there is an edge {ai,b'^) or an edge 
{ai,b^A for all i,j). Then, we define Gi{x) — {A,Bo U Bi,Ex). From Lemmaflllwe immediately obtain 
the following lemma. 
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Lemma 12. Let c > be an integer, let e > be a constant, and let c' = (1 + e)c. Let n be a sufficiently 
large integer, and let m = (c'!) <:'+i (ttt • n) ='+i . -Lei G = (A, _Bo U Bi,E) be a graph G G CJi, and 
let E' C E be such that \E'\ < en. Then there exists a set of nodes A' C A with \A'\ < m and 
degniaxsemi(A',BoUBi,£:') > l/2e-^-^{r^n)^ . 

We further define a second family of bipartite graphs G2 on the sets of nodes A and C, \A\ — \C\ = n. 
For a set A' C A we define the graph G2{A') to be an arbitrary matching from all the nodes of A' to 
nodes of C The family of graphs G2 is defined as G2 — {G2{A')\A' C A}. 

Our lower bound will be proved using a family of graphs G . Slightly abusing notation, the family of 
graphs G is defined as G = Gi^G2- That is, the graphs in G are all graphs G — {A, BqU BiU C, E1UE2) 
built from a graph Gi = {A, BqU Bi,Ei) g Gi and a graph G2 = {A, G,Ei) £ G2 where the set of nodes 
A is the same for Gi and G2. We now prove our lower bound. 

Theorem 5. Let c > be an integer and let e > be an arbitrarily small constant. Let V be a /3- 
approximation one-way two-party protocol for semi matching that has communication complexity at most 
a. If P < J = l/2^^{-^n) (i+f)<:+i , then a > en, where n is the number of nodes to be matched. 

1 1 

Proof. Take n sufficiently large. Let c' = (1 + e)c and let m — (c'!)^'+i(y^ • n)^'+'^. We consider as 

possible inputs the graphs in G (for n and m). Given an input graph, Alice will get as input all edges 

between A and Bq U Bi (i.e., a graph in Gi) and Bob will get all edges between A and G (i.e., a graph 

in G2) 

Assume towards a contradiction that the communication complexity of V is at most en. Then there 

is a set of graphs G* '^ Gi, \G*\ > 2"™"'^", such that on all graphs in G* Alice sends the same message 

to Bob. Consider the set X* C {0, 1}"><™ such that G* = {Gi{x) \ x £ X*}, Since there is a one-to-one 

correspondence between G* and X* , \X*\ > 2"™"'^", and there are at most en entries which are constant 

over all matrices in X* , otherwise \X*\ < 2"'""'=". This means that there are at most en edges that exist 

in all graphs in G* ■ Let E' be the set of all these edges. 



Consider now the graph G — (A, BoUBi,E'). Since \E'\ < en, by Lemma 12 there exists a set A' C A 
with \A'\ < m and degmaxsemi(A', Bq U Bi,E') > 7. We now define G^ G G2 to be G^ = G2{A\A'). 

Now observe that on any ofGe G* x {G2} C G, V gives the same output semi-matching S. S can 
include, as edges matching the nodes in A' , only edges from £", since for any other edge there exists an 
input in G* x {G2} in which that edge does not exist and V would not be correct on that input. It follows 
(by Lemma |12[ ) that the maximum degree of S is greater than 7. On the other hand, since \A'\ < m, 
there is a perfect matching in any graph in G* x {G2}. The approximation ratio of V is therefore greater 
than 7. A contradiction. D 

5 The Structure of Semi-Matchings 

We now present our results concerning the structure of semi-matchings. Firstly, we show in Lemma [T3| that 
a semi-matching that does not admit length 2 degree-minimizing paths can be decomposed into maximal 



matchings. In Lemma 14 we show that if a semi-matching does not admit any degree-minimizing paths, 
then there is a similar decomposition into maximum matchings. 

Lemma [13] is then used to prove that semi-matchings that do not admit length 2 degree- minimizing 
paths approximate optimal semi-matchings within a factor [log(n -|- 1)] . To this end, we firstly show in 
Lemma [l5] that the first d* maximal matchings of the decomposition of such a semi-matching match at 
least 1/2 of the A vertices, where d* is the maximal degree of an optimal semi-matching. In Theorem [6J 
we then apply this result [log(7i -I- 1)] times, showing that the maximal degree of a semi-matching that 
does not admit length 2 degree-minimizing paths is at most [log(n -|- 1)] times the maximal degree of an 
optimal semi-matching. 

Lemma 13. Let S = serm2{A,B,E) be a semi-matching in G that does not admit a length 2 degree- 
minimizing path, and let d = degmaxS". Then S can be partitioned into d matchings Afi, . . . , M^ such 
that 

Vi : Mi is a maximal matching in G\AixBi, 

where Ai ^ A, Bi ^ B, and for i > I : Ai ^ A\ Ui<j<i M^j) and Bi = B(Af,_i). 
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Proof. The matchings Mi, . . . , Md can be obtained as follows. For each h S B{S), label its incident edges 
in S* by 1, 2, ... , deg5(6) arbitrarily. Matching Mi is then the subset of edges of S that are labeled by i. 
We prove the statement by contradiction. Let i be the smallest index such that Mi is not maximal in 
G\AixBi- Then there exists an edge e — ab £ E with a £ Ai and b G Bi such that Mi U {e} is a matching 
in G\AixBi- Note that degg{b) < i since b is not matched in Af^. Consider now the edge e' G S matching 
the node a to b' in S. Since a G Ai and a is not matched in M^, e' is in a matching Mj with j > i and 
hence deg5(6') > j > i- Then P = {b',a,b) is a length 2 degree-minimizing path since deg5(6') > i and 
deg5(6) < i contradicting our assumption. D 

Lemma 14. Let S* — semi{A,B,E) be a semi-matching in G that does not admit degree-minim,izing 
paths of any length, and let d* = deg max S* . Then S* can be partitioned into d* matchings Mi , . . . , Md* 
such that 

Vi : Mi is a maximum matching in G\AixBi, 

where Ai = A, Bi ^ B, and fori>l:Ai=A\ Ui<j<i M^j) and Bi = B{Mj^i). 

Proof. The proof is similar to the proof of Lemma [TSJ The matchings A/i , . . . , Md- can be obtained as 
follows. For each b G B{S), label its incident edges in 5 by 1, 2, ... , deg^. (6) arbitrarily. Matching Mi is 
then the subset of edges of S that are labeled by i. 

We prove the statement by contradiction. Let i be the smallest index such that Mi is not a maximum 
matching in G\AixBi- Then there exists an augmenting path A = (ai,&i, . . .a;, 6;) such that for all 
j < I : {aj+i,bj) G Mi and Vi : {ai,bi) ^ Mi. Let b' be the match of ai in S* . Since ai G Ai, 
degg. (6') > i. Since bi G Bi and 6/ is not matched in M* , deg^. (5/) < i. Then P — (6', ai,bi, . . . ,ai,bi) 
is a degree-minimizing path contradicting our assumption. D 

We firstly prove a lemma that is required in the proof of Theorem [6] 

Lemma 15. Let A' C A, let S — seim2{A' , B, E) be a semi-matching in G\a'xb that does not admit 
length 2 degree-minimizing paths and let S* — seim(A' , B , E) be an optimal semi-matching in G\a'xb- 
Then 3 A" C A' with \A"\>1/2\A'\ such that 

1. degmaxS'|yi"xs < degmaxS"*, 

2. S\a'\a"xB is a semi-matching of G\a'\a"xB '^i^d it does not admit length 2 degree-minimizing paths. 



Proof. Let d — deg max S and let d* = deg max S* . Partition S into matchings Mi , . . . , Md as in 
Lemma [Tsj We will show that A" = Ut<d* M^i) fulfills Item [T| and Iteml2|of the Lemma. 

We have to show that \A"\ > 1/2|^'|. Let A"' ^ A' \ A" and let (a, 5j e S* be an edge such that 
a G A'" . We argue now, that degg(6) > d* . 

Suppose for the sake of a contradiction that deg^(6) < d* . Then (a, b) could have been added to some 
matching Mj with j < d* . Since by Lemma 13 all Mi are maximal, we obtain a contradiction and this 



proves that degg{b) > d* . 

This implies further that \A"\ > d* ■ \B{S*\a"'xb)\ > d* ■ \A'"\/d* = |A"'|, where the last inequality 
comes from the fact that a node b G B{S*\a"'xb) has at most d* edges incident in S* . Since A'" and A" 
form a partition of A', we obtain \A"\ > 1/2\A'\. 

Since A" — A{S\a"xb) and S\a"xb is a set of d* matchings, Item [I] is trivially true. Concerning 
Item pi note that if <S'|^'\^"xs admitted a length 2 degree-minimizing path, then that path would also 
be a degree-minimizing path in S contradicting the premise that S does not admit a length 2 degree- 
minimizing path. D 

Theorem 6. Let S — serm2{A,B,E) be a semi-matching of G that does not admit a length 2 degree- 
minimizing path. Let S* be an optimal semi-matching in G. Then: 

deg max S* < [log(n -I- 1)] deg max S'*. 

Proof. Wc construct a sequence of vertex sets (Ai) and a sequence of semi- matchings {Si) as follows. Let 
Ai = A, and let Si = S. For any i, Si will be a semi-matching in the graph G\AixB and it will not admit 
length 2 degree-minimizing paths. 
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We construct A^+i and Si+i from Ai and Si as follows. By Item IT] of Lemma 15 there is a subset 
A'^ C Ai of size at least l/2|^i| such that Si\A'.xB has maximal degree d* . Let Ai+i = Ai\ A[, and let 
Si+i = SjIa.+ixs- By Item[2]of Lemma 
the graph G\a,+^ 
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S'i+i does not comprise length 2 degree-minimizing paths in 



;S- We stop this construction at iteration 



when A', 



Ai occurs. 

Note that -S* = IJ^ "Sil^/xs and hence degmax5 < J2i=i deg max S'i | ^' x s ^ I • d* . It remains to argue 

that / < log(n) + 1. SinceVil > l/2|^i| and A^+i = ^^ \ A'^, we have \A,+i\ < 1/2\A,\. Since \Ai\ = n, 



we have \A^\ < {^Y ^n. Then, |Apiog(„+i)^ 
I £ riog(n + 1)1 , which proves the theorem. 



< 1 which implies that |^riog(iH 



1)1 



= 0. We obtain hence 

D 
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